Text Categorization from Category Name via Lexical Reference

نویسندگان

  • Libby Barak
  • Ido Dagan
  • Eyal Shnarch
چکیده

Requiring only category names as user input is a highly attractive, yet hardly explored, setting for text categorization. Earlier bootstrapping results relied on similarity in LSA space, which captures rather coarse contextual similarity. We suggest improving this scheme by identifying concrete references to the category name’s meaning, obtaining a special variant of lexical expansion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Categorization of Semantics of Fashion Language: A Memetic Approach

Categories are not invariant. This paper attempts to explore the dynamic nature of semantic category, in particular, that of fashion language, based on the cognitive theory of Dawkins’ memetics, a new theory of cultural evolution. Semantic attributes of linguistic memes decrease or proliferate in replication and spreading, which involves a dynamic development of semantic category. More specific...

متن کامل

Feature-Semantic Gradients in Lexical Categorization Revealed by Graded Manual Responses

Participants performed a categorization task in which basiclevel animal names (e.g., cat) were assigned to their superordinate categories (e.g., mammal). Manual motor output was measured by sampling computer-mouse movement while participants clicked on the correct superordinate category label, and not on a simultaneously presented incorrect category. Animal names were selected from the concept-...

متن کامل

Categorizing Local Contexts as a Step in Grammatical Category Induction

Building on the use of local contexts, or frames, for human category acquisition, we explore the treatment of contexts as categories. This allows us to examine and evaluate the categorical properties that local unsupervised methods can distinguish and their relationship to corpus POS tags. From there, we use lexical information to combine contexts in a way which preserves the intended category,...

متن کامل

Categorical Information in Pharmaceutical Terminologies

Drug information sources use category labels to assist in navigating and organizing information. Some category labels describe drugs from multiple perspectives (e.g., both structure and function). The National Drug File - Reference Terminology (NDF RT) is a drug information source that augments a "legacy" categorization system with a formal reference model specifying Chemical Structure, Cellula...

متن کامل

L2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors

This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009